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PROTEIN 



This invention relates to newly identified polynucleotides, polypeptides encoded by 
these polynucleotides, to the prodiiciion of such polynucleotides and polypeptides, and to the 
5 uses of such polynucleotides and polypeptides. More specifically,, the invention relates to the 
phosphonievalonate kinase (PMK) eene (ERGS gene) from Candida albicans (C albicans). 
to methods for its expression yielding phosphomevalonate kinase protein, to novel hybrid 
organisms for use in such expression methods, to methods for purification of the protein, to 
methods and tools for diagnosing C albicans infection and to assays for identifying inhibitors 

10 of the enzyme which inhibitors have potential as anti-fungal agents. 

C albicans is the most important human fungal pathogen and the most prominent 
target organism for antifungal research. PMK is an enzyme required for the biosynthesis of 
isoprene subunits that are used as precursors in the synthesis of sterols, dolichols and 
ubiquinones. As PMK is an essential biosynthetic enzyme, inhibitors of PMK should find use 

15 as antifungal agents. All species synthesise a protein with PMK activity however, across 
species the enzymes differ considerably in their amino acid sequence. Because of selectivity 
problems (for example fungal versus human) it is extremely important to optimise potential 
inhibitors specifically against the fungal target enzynaes (i.e. C albicans or Aspergillus 
fumigaius) and not against the human enzyme. Such cross-fungal-species inhibitors possess 

20 broad specificity. Altematively, it may be desirable to use an inhibitor which is more 
selective, for example, one that inhibits C. albicans PMK but not a homologous but non- 
identical fungal PMK protein such as fi-om Saccharomyces cerevisiae (S. cerevisiae). 

In view of the increased incidence of fungal resistance to existing anti-fiingal agents 
and fuelled by the growing number of fungal infections particularly in people with 

25 immunodeficiency disorders, organ transplants and cancer, there is a need for new means of 
identifying potential anti-fungal agents. 

We have now successfully cloned the ERGS gene fi*om C. albicans (hereinafter 
referred to as ERGS gene) and determined its full length nucleotide sequence and 
corresponding (PMK) polypeptide sequence (hereinafter referred to as ERGS protein) as set 

30 out in Figures 3 and 4 of this application respectively. The coding DNA sequence (SEQ ED 
NO. 6) of the C. albicans ERGS gene isolated is 1299 nucleotides in length and the 
corresponding protein sequence is 433 amino acids in length (SEQ ID NO. 7). The protein 
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exhibits approximately 45% homology with the corresponding protein from S. cerevisiae and 
only about 1 0% homology to that of the human protein equivalent. Homology as used herein, 
takes the definition known to and routinely used by molecular biologists. It refers to the 
sequence identity between two sequences as assessed by best-fit computer alignment analysis 
f using suitable software such as Blast, Blast2, fastA, Fasta3 and PILEUP. Preferred software 
for use in assessing the percent homology, i.e how two primary polypeptide sequences line up 
is PILEUP. Identity refers to direct matches. 

According to a first aspect of the invention there is provided an isolated polypeptide 
which is ERGS protein, as well as variants thereof The preferred polypeptide sequence is that 

1 0 as set out in Figure 4 (SEQ ID NO. 7). The complete C albicans phosphomevalonate kinase 
enzyme polypeptide has the amino acid sequence as depicted in Figure 4 herein. The 
polypeptides of the present invention include the polypeptide of Figure 4 as well as 
polypeptides which have in increasing order of preference, at least 75%, 80%, 85%, 90%, 
95%, 96%, 97%, 98%, and 99% homology (interchangeably termed identity herein) to the 

15 polypeptide whose amino acid sequence is depicted in SEQ ID NO. 7. 

Although the natural polypeptide of SEQ ID NO. 7 and a variant polypeptide may 
only possess for example 80% identity, they are actually likely to possess a higher degree of 
similarity, depending on the number of dissimilar codons that are conservative changes. 
Similarity between two sequences includes direct matches as well as conserved amino acid 

20 substitutes which possess similar structural or chemical properties, e.g. similar charge. 

Examples of conservative changes (conserved amino acid substitutes) are inter alia: alanine to 
glycine, isoleucine, valine or leucine; tyrosine to phenylalanine or tryptophan; and lysine to 
arginine or histidine. 

The invention also relates to variant polypeptide sequences encoded by nucleic acid 
25 capable of hybridising with nucleic acid coding for the natural polypeptide (SEQ ID No. 6, or 
its complementary antisense strand)(or would do so but for the degeneracy of the genetic 
code), for example under stringent conditions (such as at 35°C to 65^0 in a salt solution of 
approximately 0.9M). Such hybridisable polynucleotides are also part of the invention. The 
present invention particularly relates to polynucleotides which hybridise to the ERGS 
30 polynucleotide sequence depicted in SEQ ID NO. 6, its complementary sequence, or fragment 
thereof, under stringent conditions. As used herein, stringent conditions are those conditions 
which enable sequences that possess at least 80%, preferably at least 90% and more preferably 
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at least 95% sequence identity to hybridise together. Thus, nucleic acids which can hybridise 
to the nucleic acid of SEO ID No. 6, or the complementar\' antisense strand thereof, include 
nucleic acids which have at least 80%, preferably at least 90%, more preferably at least 95%, 
still more preferably at least 98% sequence identity and most preferably 100%. over at least a 
: ponion of the nucleic acid encoding the ERGS' gene disclosed herein. As well as fiill-lenpth 
pene sequences, smaller nucleic acid fragments for e>:ample oligonucleotide primers which 
can be used to amplify the ERGS gene using any of the well known amplification systems 
such as polymerase chain reaction (PGR), or fragments that can be used as diagnostic probes 
to identify corresponding nucleic acid sequences are also part of this invention. The invention 
0 thus includes polynucleotides of shorter length than the full length ERGS gene sequence 
depicted in SEQ ID No. 6, that are capable of specifically hybridising to the nucleic acid 
encoding the C. albicans ERGS gene described herein. Such polynucleotides may be at least 
10 nucleotides in length, preferably at least 15, more preferably at least 20 and most 
preferably at least 30 nucleotides in length and may be of any size up to and including the fiill 
5 length ERGS nucleotide sequence. 

An example of a suitable hybridisation solution when a nucleic acid is immobilised on 
a nylon membrane and the probe nucleic acid is greater than 500 bases or base pairs is: 6 x 
SSC (saline sodium citrate), 0.5%> SDS (sodium dodecyl sulphate), lOO^g/ml denatured, 
sonicated salmon sperm DNA. The hybridisation being performed at 68°C for at least 1 hour 
and the filters then washed at 68°C in 1 x SSC, or for higher stringency, 0.1 x SSC/0.1%) 
SDS. 

An example of a suitable hybridisation solution when a nucleic acid is immobilised on 
a nylon membrane and the probe is an oligonucleotide of between 12 and 50 bases is: 3M 
trimethylammonium chloride (TMACl), 0.01 M sodium phosphate (pH 6.8), IraM EDTA (pH 
7,6) , 0.5% SDS,100)ig/ml denatured, sonicated salmon sperm DNA and 0.1 dried skimmed 
milk. The optimal hybridisation temperature (Tm) is usually chosen to be 5°C below the Ti of 
the hybrid chain. Ti is the irreversible melting temperature of the hybrid formed between the 
probe and its target. If there are any mismatches between the probe and the target, the Tm 
will be lower. As a general guide, the recommended hybridisation temperature for 17-mers in 
3M TMACl is 48-50°C; for 19-mers,it is 55-57°C; and for 20-mers, it is 58-66°C. 

A suitable hybridisation protocol is described in Example 5 herein, however, operable 
variations to this method will be apparent to the person skilled in the art. 
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Fragments of such polypeptides comprising at least 30 amino acids and preferably at 
least 50 amino acids are also encompassed by the pres.ent invention. Such fragments may be 
used as intennediaies to generate longer polypeptide fragments including preferably, the full- 
length polypeptide sequence as depicted in Figure 4, or a functional variant thereof. Such 
: polypeptide iragments may also be used to raise antibodies against parts of the ERGS protein. 

As used herein, the term 'variant' includes naturally occurring allelic variants as well 
as non-naturally occurring variants, fragments and analogs of the sequences depicted in SEQ 
ID NOs. 6 or 7. Such x'ariants include C- or N-truncated variants, deletion variants, 
substitution variants as well as addition and mserlion variants. The term 'analog' refers to 

10 proproteins which can be activated by cleavage of the proprotein ponion to release the 
biologically active polypeptide or protein. The term 'derivative' refers to a polypeptide 
encoded by a chemically modified ERGS gene, for example one wherein hydrogen has been 
replaced by an acy] or amino group, as well as polypeptides possessing one or more non- 
natural amino acids. When referring to a polypeptide or protein sequence, a functional variant 

15 is one that has retained at least some PMK enzymatic activity. The variant polypeptides of 
the present invention may comprise internal, but preferably, terminal flanking sequences 
(fusion proteins) to facilitate protein purification. Such 'additional domain' sequences (Flag 
sequences) may comprise for example, metal chelating peptides such as histidine-tryptophan 
modules (including 6-his tags) that allow purification of the polypeptide on immobilised 

20 metals, protein A domains that allow purification on immobilised immunoglobulin, or peptide 
domains that allow purification on immobilised antibodies specific for the peptide. Other 
suitable 'additional purification domains' will be known to the person skilled in the art. 

According to a preferred embodiment of the invention the native ERGS polypeptide 
sequence (having the sequence as depicted in SEQ ID No. 7) is fused at its amino terminus to 

25 six histidine residues which serve to enable the polypeptide, once expressed fi-om the host 
cell, to be isolated and purified by affinity chromatography using a Ni-chelate resin. 

A flanking purification domain may be separated fi-om the ERGS polypeptide by a 
cleavage sequence such as that recognised by thrombin or Factor Xa so as to facilitate release 
of the polypeptide from the flanking sequence which may or may not be attached to an 

30 immobilised support. Alternatively, cyanogen bromide which cleaves at methionine residues 
can be employed to release the desired polypeptide fi-om its flanking sequence. 
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The polypeptides of the invention can be synthesised chemically. For example, by the 
Mern'field technique (J. Amer. Chem. Soc. 85:2149-2154, 1968). Numerous automated 
polypeptide synthesisers, such as Applied Biosystems 431 A Peptide Synthesizer also now 
exisi. Alternatively, and preferably, the polypeptides of the invention are produced from a 
5 nucleotide sequence ericodinp the pclypeptide usuvp reconibmani expression techrjoio<?*v . 
In a funher aspect of the invention there are provided isolated polynucleotides (including 
genomic DNA. genomic RNA, cDNA and mRNA; double stranded as well as -rve and -ve 
strands) which encode the polypeptides of the invention. Single stranded DNA molecules of 
all or pan of the ERGS gene either -fve or -ve strand, find use inter alia, as hybridisation 

10 probes or PCR amplification primers. The sense strand of the complete gene sequence of 
native ERGS is depicted in Figure 3 (SEQ ID No. 6) hereinafter. It will be appreciated that a 
polynucleotide of the invention may comprise any of the degenerate codes for a particular 
amino acid, including the use of rare codons. Indeed, when producing the polypeptide by 
recombinant expression in heterologous host strains, it may be desirable to adopt the codon 

15 useage (preference) of the host organism (Murray. N.A.R. 17:477-508, 1989). 

The invention further comprises convenient fragments of any one of the above 
sequences. Convenient fragments may be defined by restriction endonuclease digests of 
nucleic acid comprising the ERGS gene sequence. Such fragments are useful inter alia, for 
expressing short polypeptides fragments of ERGS protein of the invention as well as for use 

20 as hybridisation probes. The present invention also provides a polynucleotide probe 

comprising any one of the above sequences or fragments together with a convenient label or 
marker, preferably a non-radioactive label or marker. Following procedures well known in 
the art, the probes can be used to identify and isolate not only corresponding nucleic acid 
sequences (i.e C. albicans ERGS gene sequences) but, if sufficiently homologous, can also be 

25 used to identify the analogous gene from other organisms using techniques well knovsoi to the 
person skilled in the art. Such sequences may be comprised in libraries, such as genomic or 
cDNA libraries. The present invention also provides RNA transcripts corresponding to any of 
the above C, albicans ERGS sequences or fragments. RNA transcripts can be used to prepare 
a polypeptide of the invention by in vitro translation techniques according to known methods 

30 (Sambrook et al "Molecular Cloning- A Laboratory Manual, second edition 1 989"). The 
invention further comprises full-length or fragment lengths of ERGS gene (coding sequence) 
flanked by non-coding sequence which may include natural or non-natural sequence 
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containing restriction enzyme recognition sequence motifs. The incorporation of suitable 
restriction enzyme recognition sites either side of the ERGS coding region, or indeed any 
polynucleotide sequence from ERGS, facilitates cloning of the ERGS gene or polynucleotide 
sequence into a suitable vectoi. A suitable polynucleotide comprises a full length C albicans 
^ ERGS gene (encoding the polypeptide that starts with methionine at position 1 and terminales 
with the leucine that precedes the stop codon TAA at position 1299 of Figure 3) flanked by 
unique Hindll] (5'-end)-Xho] r3 '-end) restriction sites. Examples of oligonucleotide primers 
which are suitable for use in PGR amplification of ERGS, and which incorporate useful 
restriction enzyme sites to facilitate cloning, are detailed in Figure 6. Nucleotide changes or 
10 mutations may be introduced into a polynucleotide sequence by de novo polynucleotide 

synthesis, by site directed mutagenesis using appropriately designed oligonucleotide primers 
or by any other convenient means know to the person skilled in the art. 

For expression purposes, it may be advantageous to engineer a restriction site at the 5'- 
end which is also capable of reconstituting the native amino-terminal methionine of the 
15 protein. The cleavage recognition sequence for the Ncol restriction enzyme not only includes 
a sequence that codes for methionine, but also one that is capable of retaining a functional 
Kozak consensus sequence, enabling the ERGS gene to be cloned at the 3 '-end of a suitable 
promoter element in an expression vector. 

The DNA polynucleotides can be synthesised chemically, or isolated by one of several 
20 approaches known to the person skilled in the art such as polymerase chain reaction (PCR) or 
ligase chain reaction (LCR) or by cloning from a genomic or cDNA library. 

Once isolated or synthesised, a variety of expression vector/host systems may be used 
to express ERGS coding sequences. These include, but are not limited to microorganisms 
such as bacteria expressed with plasmids, cosmids or bacteriophage; yeasts tranformed with 
25 expression vectors; insect cell systems transfected with baculovirus expression systems; plant 
cell systems transfected with plant virus expression systems, such as cauliflower mosaic virus; 
or mammalian cell systems (for example those transfected with adenoviral vectors); selection 
of the most appropriate system is a matter of choice. 

Expression vectors usually include an origin of replication, a promoter, a translation 
30 initiation site, optionally a signal peptide, a polyadenylation site, and a transcription 

termination site. These vectors also usually contain one or more antibiotic resistance marker 
gene(s) for selection. As noted above, suitable expression vectors may be plasmids, cosmids 
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or viruses such as phage or retroviruses. The coding sequence of the polypeptide is placed 
under the control of an appropriate promoter, control elements and transcription terminator so 
that the nucleic acid sequence encoding the polypeptide is transcribed into RNA in the host 
cell transformed or iransfected by the expression vector construct. The coding sequence ma\ 
5 or nia\' not ccntairi a signal peptide or leader sequence for secretion of the polypeptide out of 
the host cell. Expression and purification of the polypeptides of the invention can be easilv 
performed using methods well known in the art (for example as described in Sambrook et al. 
"Molecular Cloning- A Laborator\' Manual, second edition 1989"). 

The vectors containing the DNA coding for the ERGS polypeptides of the invention 

1 0 can be introduced (i.e transformed or transfected) into £". coli^ S, cerevisiae, Pichia pastoris or 
any other suitable host to facilitate their manipulation (i.e. for mutagenesis, cloning or 
expression). Performance of the invention is neither dependent on nor limited to any 
particular strain of host cell or vector; those suitable for use in the invention will be apparent 
to, and a matter of choice for, the person skilled in the art. 

15 Host cells transformed or transfected with a vector containing an ERGS nucleotide 

sequence may be cultured under conditions suitable for the expression and recovery of the 
encoded proteins from the cell culture. Such expressed proteins/polypeptides may be secreted 
into the culture medium or they may be contained intracellularly depending on the sequences 
used, i.e. whether or not suitable secretion signal sequences were present. 

20 The full-length native isolated C. albicans ERGS protein (PMK enzyme) of the 

present invention, or a functional variant thereof, is useful as a target in biochemical assays, 
particularly for use in identifying inhibitors of the enzyme. However, to provide sufficient 
enzyme for a biochemical assays (for example; for use in a high throughput screen for enzyme 
inhibitors) the enzyme has to be expressed at high levels and it has to be purified. Two major 

25 constraints impair ERGS expression and purification: (i) ERGS is not expressed at high levels 
from C albicans, and (ii) expression and protein purification methodology is not well 
advanced for C albicans. 

We have now been able to overcome these problems by controlled over-expression of 
the C albicans ERGS in a strain of Saccharomyces cerevisiae. S. cerevisiae is a model 

30 system for expression and purification of recombinant proteins. Use of S. cerevisiae to 
express C. albica?7s ERGS means that transformation, expression and purification 
methodology used to produce and isolate the ERGS protein can follow published procedures. 
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As stated above, the invention is not limited to use of S. cerevisiae as the host for expression 
of C albicans ERGS. 

According to a further aspect of the invention there is provided a host cell adapted to 
express C. albicans ERGS polypeptide or a variant thereof. The yeast S. cerevisiae is the 
r prefen ed host cell of choice. According to a funher aspect of the invention there is provided a 
novel expression system for expression of the C albicans ERGS gene, which system 
comprises an 5. cerevisiae host strain having the C. albicans ERGS gene in place of the native 
ERGS gene from S. cerevisiae. whereby the C albicans ERGS gene is expressed. Preferred S. 
cerevisiae strains include JK9-3Daa and its haploid segregants. 

1 0 The C albicans ERGS gene is preferably over-expressed relative to the expression 

derived from its own promoter. This is conveniently achieved by replacing the C. albicans 
ERGS promoter by a stronger and preferably inducible promoter such as the 5. cerevisiae 
GALl promoter, alpha factor, alcohol oxidase (for reviews see Ausubel et al. "Current 
Protocols in Molecular Biology", John Wiley & Sons, New York.). 

1 5 The novel expression system is conveniently prepared by transformation of a 

heterozygous ERGS deletion strain of a convenient S. cerevisiae host by a suitable plasmid 
comprising the C. albicans ERGS gene using methods well known in the art (Ito et al. J. 
Bacteriol. 153:163-168,1983; SchiestI and Grietz, Current Genetics 16:339-346,1989). 
The plasmid comprising the C. albicans ERGS represents a further aspect of the 

20 invention. Particularly suitable plasmids for expression of C. albicans ERGS in S. cerevisiae 
include YEp24, pRS316, pYES2(Invitrogen) and plasmids derived from pYES2 carying a 
native S. cerevisiae promoter such as the glyceraldehyde-3-dehydrogenase promoter. 

The heterozygous ERGS deletion strain of a diploid S. cerevisiae host is conveniently 
achieved by disruption preferably using an antibiotic resistance cassette such as the 

25 kanamycin resistance cassette described by Wach et al (Yeast. 10:1793-1808, 1994). 

As described earlier, the C albicans ERGS enzyme may be used in biochemical assays 
to identify agents which modulate the activity of the enzyme. The design and implementation 
of such assays will be evident to the biochemist of ordinary skill. The enzyme may be used to 
turn over a convenient substrate whilst incorporating/losing a labelled component to define a 

30 test system. Test compounds are introduced into the test system and measurements made to 
determine their effect on enzyme activity. Such assays are useful to identify inhibitors of the 
enzyme which may then prove valuable as antifungal agents. 
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Thus, in a further aspect of the invention we provide the use of a C. albicans ERG-8 
gene and/or C. albicans PMK enzyme in an assay to identify inhibitors of the enzyme. In 
particular, we provide their use in pharmaceutical or agrochemical research. 

The PMK TERGS) protein catalyses the conversion of phosphomevalonate -f ATP to 
r pyrcphosphonjevajonau - By way of non-liiTiitirjp example, the aciivitv of the ERGb 

enzyme may be determined by (i) measuring the increase in ADP production, (ii) by following 
the loss of ATP, or (iii) by monitoring transfer of radioactive label (i.e H% P^-) into 
phosphomevalonate. 

A suitable assay that measures ADP production involves coupling the ADP produced 

1 0 by the action of PMK on phosphomevalonate -f ATP substrate with pyruvate kinase and 

phosphoenolpyruvate to form pyruvate and ATP. The pyruvate is then reduced to lactate with 
lactate dehydrogenase which converts NADH to NAD. The production of NAD (directly 
linked to ADP production indicative of PMK action) is conveniently measured by detecting 
the change in absorbance at 340nm (NADH oxidation product). In this assay, test compoimds 

1 5 that inhibit PMK activity are identified by determining the ability of a compound to inhibit 
PMK activity as assessed by a reduction in ADP production as gauged by a reduction in the 
production of NAD from NADH using pyruvate kinase and lactate dehydrogenase as coupling 
enzymes as described above. The person skilled in the an would be able to develop other 
assays for measuring PMK activity without inventive input. 

20 ATP can be conveniently assayed using commercially available kits (i.e Boehringer 

Mannheim) to monitor luminescence resulting from oxidation of luciferin to luciferase (Ford 
et al. J. Biolumin. Chemilumin. 11:149-167, 1996). 

A suitable reaction that measures the production of radioactively labelled 
phosphomevalonate involves incubation PMK enzyme with cofactors, substrate ATP and 

25 phosphomevalonate, one of which carries a radioactive label. After reaction, 

pyrophosphomevalonate can be resolved from unreacted substrate by high voltage 
electrophoresis at pH3.5 on 3MM paper and the amount of radioactivity incorporated into 
pyrophosphomevalonate can be measured by scintillation counting (Lee andO' Sullivan. J. 
Biol. Chem. 260:13909-13915, 1985). 

30 Any convenient test compound or library of test compounds may be used in 

conjunction with the test assay. Particular test compounds include low molecular weight 
chemical compounds (preferably with a molecular weight less than 1500 daltons) suitable as 
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pharmaceutical or veterinary agents for human or animal use, or compounds for non- 
administered use such as cleaning/sterilising agents or for agricuhural use. 

The ERGS enzyme of the invention, and convenient fragments thereof may be used to 
raise antibodies:. Such antibodies have a number of uses which will be evident to the 
f molecular biologist or immunologist of ordinary' skill. Such uses include, but are not limited 
to, monitoring enzyme expression, development of assays to measure enzyme activity, 
precipitation or purification of the enzyme and as a diagnostic tool to detect C albicans. 
Enzyme linked immunosorbant assays (ELISAs ) are well known m the an and would be 
particularly suitable for detecting the ERGS polypeptide or fragments thereof. Antibodies 

10 raised against the polypeptides of the invention may be polyclonal, obtained for example by 
injecting the polypeptide(s) into a selected mammal (i.e. rabbit, mouse, goat or horse), and 
later collecting the immunised serum from the animal, and treating this according to 
procedures knowm in the art. Depending on the host species, various adjuvants may be used 
to enhance the immunological response against the injected polypeptide. Suitable adjuvants 

15 include, but are not limited to Freud's, aluminium hydroxide and SAF. Antibodies may also 
be monoclonal antibodies produced by hybridoma cells, phage display hbraries or other 
methodology. Monoclonal antibodies may be inter alia, human, rat or mouse derived. For the 
production of human monoclonal antibodies, hybridoma cells may be prepared by fusing 
spleen cells from an immunised animal, e.g. a mouse, with a tumour cell. Appropriately 

20 secreting hybridoma cells may thereafter be selected (Koehler & Milstein. Nature. 256:495- 
497, 1975; Cole et al, "Monoclonal antibodies and Cancer Therapy, Alan R Liss Inc, New 
York N.Y. pp 77-96). Rodent antibodies may be humanised using recombinant DNA 
technology according to techniques known in the art. Alternatively, chimeric antibodies, 
single chain antibodies. Fab fragments may also be developed against the polypeptides of the 

25 invention (Huse et al. Science. 256:1275-1281, 1989), using skills known in the art. 

The polynucleotides of the invention may be used in giene-probe methodology, with or 
without amplification to detect or diagnose the presence of C. albicans. This is particularly 
valuable in diagnosing clinical infections. Accordingly, the invention provides diagnostic kits 
for the detection of C. albicans ERGS or fragments thereof, and provides for the use of ERGS 

30 protein, polypeptide fragments thereof and/or antibodies raised there against as positive 
control. 
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DNA diagnostics is based on DNA/RNA hybridisation technology, i.e. the specific in 
vitro binding of complementary single-stranded nucleic acid with the formation of double- 
stranded nucleic acid. The DNA/DNA or DNA/RNA double strands formed are temied 
hybrids. To detect the presence of C. albicans in a bodily fluid such as blood, total nucleic 
5 acid is isolated from the lesi fluid saniple usinc siandard techniques and the presence oi 

C dilbicans ERGS nucleic acid in the sample is detected using for example detectably labelled 
probes comprising one or more of the polynucleotides of the invention. The probes can be 
shon, chemically synthesised oligonucleotide probes of a length of approximately 1 0 - 50 
nucleotides, or may be recombinantly expressed fragments of the ERGS gene of 

10 approximately 0.3-1 .5Kb in size. Single stranded oligonucleotide probes which are specific 
for C albicans are preferred. The probe can be provided with a suitable detectable reporter 
molecule label such as a radioisotope (P'% tritium, Cor S"), or a non-radioactive label such as 
digoxigenin or biotin, using techniques available to the person skilled in the art. Prior to the 
hybridisation reaction, all or any part of C. albicans ERGS DNA containing the sequence to 

15 which the probe can hybridise, present in the test sample is amplified using for example PGR 
(polymerase chain reaction) or LCR (ligase chain reaction). For the specific hybridisation 
reaction, the test nucleic acid and if necessary the probe DNA is converted into single strands 
by denaturation (heat or alkali) and then very specifically hybridised with each other under 
stringent conditions. Under appropriate conditions the gene probe only hybridises to 

20 complementary sequences of the DNA or RNA to be detected. The hybridisation and 
detection assay can be carried out in a number of different formats known to the person 
skilled in the art including, solid-phase hybridisation of target DNA or probe coupled to a 
solid support such as nitrocellulose or magnetic beads. The hybridisation complex can then 
be determined quantitatively, following removal of unbound probe or test nucleic acid, by 

25 way of the reporter molecule label (e.g. fluorescent or radioactive) employed. 

The test sensitivity of this single gene-probe diagnostic method can be increased by 
combination with DNA or RNA amplification techniques such as PGR or LCR. Using such 
amphfication techniques, the DNA to be detected can be multipled by up to 10^ 

There may only be 100-1000 organisms per ml of blood in association with Candida 

30 infections. Such small numbers of cells are easily detectable when combining the 

amplification and DNA-probe detection techniques offering the possibility of early detection 
of infection. 
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The following examples and figures describe and illustrate the invention. They are not 
intended to limit the scope of the invention in any way: 

Figures 1 and 2 sho\\' partial nucleotide sequences fiom two separate clones from a C. 
5 albiccms genomic library. Underlined regions were used to derive PCR primers, to generate a 
C. albicans ERGS specific probe. 

Figure 3 shows the nucleotide sequence of the C albicans gene encoding 
phosphomevalonate kinase. 1 ranslation sian (ATG) and siop (TAA) codons are highlighted. 

Figure 4 shows amino-acid sequence of C. albicans phosphomevalonate kinase. 
1 0 Fi gure 5 shows the sequences of the oligonucleotides used for the amplification of a 

kanamycin resistance cassette for disruption of ERGS in 5. cerevisiae. 

Figure 6 shows the sequences of the oligonucleotides used to amplify C albicans 
ERGS for cloning into pYES2, with restriction enzyme sites highlighted. 

Figure 7 shows the sequences of the oligonucleotides used to amplify C albicans 
15 ERGS for cloning into pT7#3.3 wqth restriction enzyme sites highlighted. 

Figure 8 shows the sequences of the oligonucleotides used to incorporate His^ residues 
into the N-or C-terminus of ERGS protein after expression in E. colt from pT7#3.3 
Convenient restriction enzyme sites are highlighted. 

20 EXAMPLES 

1. Cloning and partial sequence determination of two separate clones from a 
Candida albicans genomic library. 

Two separate cloned and sequenced nucleic acid sequences from a C. albicans library 
25 (see Figures 1 and 2) (SEQ ID NOs. 1 & 3) were found to have homology to that of 5. 

cerevisiae ERGS gene. The complement of the highlighted sequences in Figs. 1 and 2 were 
synthesised as oligonucleotides for use in the isolation of a clone containing the C. albicans 
ERGS gene (SEQ ID Nos. 2 & 4). 

30 2. Cloning and sequence determination of Candida albicans ERGS. 

Using the two oligonucleotide primers that are the complement of the sequences 
highlighted in Figures 1 & 2, the C. albicans ERGS gene was isolated as a plasmid clone 



98-110 

-13- 

from a library of C albicans genomic DNA in the yeast shuttle vector YEp24 using PGR. 
The C albicans ]ibrar\^ was maintained in E, coli and independent bacterial colonies were 
grown in single wells of each of 1 5 x 3S4-well microtitre plates. The properties of the librar>' 
plasniids are such that this gridded array contains approximately 2,5x the amount of DNA m 
5 the C. albicans genome . 

Small aljquots of cells from each of the wells were mixed lo produce a pool of cells 
thai were derived from all of the wells from a single plate. Similar pools were made for all of 
the rows and all of the columns from each of the plates. Samples of each of the pools of the 
cells for each complete plate were used in PGR reactions with oligonucleotide primers being 
10 the complement of the sequences highlighted in Figures 1 & 2 to identify plate(s) in the array 
carrying C albicans ERGS. Subsequent PGR reactions with pools of cells from rows of wells 
and columns of wells defined the specific well(s) carrying a clone of C albicans ERGS. 

The PGR reactions contained in a total volume of 0.05ml: 75mM Tris-HGl (pH 8.8 at 
20mM (NHO^SO,, 1.5mM MgGU, 0.01% Tween 20, 0.2mM of each of dATP, dGTP, 
15 dGTP and dTTP, 1 .25 units Taq DNA polymerase, lOOpmoles of each oligonucleotide primer 
and 0.005ml E. coli cell suspension. PGR reactions were incubated at 94*'G for 1 min then for 
30 cycles of the following: 94C for 1 min, 55C for 1 min, 72*'G for 1 min. PGR products 
were analysed by electrophoresis through agarose and visualised under UV light after staining 
with ethidium bromide. 

20 The complete sequence of the C albicans ERGS gene was determined on both strands 

using flanking sequence- or insert sequence-specific oligonucleotide primers. The fiill-length 
of the C. albicans ERGS gene is shown in Figure 3 and the protein translation in Figure 4. 

3. Generation of a heterozygous ERGS deletion strain of 5. cerevisiae 
25 Since PMK is an essential enzyme, only one allele of a diploid cell can be deleted 

without loss of viability. One ERGS gene diploid strain of S. cerevisiae (JK9-3daa; Kunz et 
al., Gell 73:585-596 (1993)) was disrupted using a kanamycin resistance cassette as described 
by Wach et al. (Yeast 10: 1793-1 SOS, 1994) using the protocol described therein with the 
oligonucleotides shown in Figure 5. Sporulation of the heterozygous diploid 
30 (ERG8/erg8::KanMX) yields only two viable spores that are both sensitive to kanamycin, 
showing ERGS to be essential, and the characteristic arrest phenotype for the two inviable 
spores. 
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4. Complementation of a S. cerevisiae ERG8 deletion with the cloned C albicans 
ERGS 

The heterozygous ERGS/erg8::KanA4X strain was transformed with the plasmid 
carrying the full-length C. albicans ERG8 gene within a fragment of C. albicans genomic 
5 DNA such that expression of the gene will depend on functionality of the C albican:^ 

promoter in the heterologous S. cerevisiae host. Surprisingly, the gene carried on the plasmid 
failed to complement the gene deletion as demonstrated by a failure to recover kanamycin- 
resistant haploid cells after sporulation. This was probably due to mappropriate expression of 
C. albicans ERGS in S. cerevisiae. 

1 0 To enable expression of C. albicans ERGS in 5. cerevisiae and to facilitate 

purification of ERGS protein as a result of over-expression in a suitable host, the C. albicans 
promoter was replaced by the efficient, inducible 5. cerevisiae GALl promoter. The C. 
albicans ERGS coding sequence was amplified by PGR using the oligonucleotides shown in 
Figure 6, which contain convenient restriction enzyme sites for cloning the product of PGR 

15 into an appropriate expression vector such as pyES2 (Invitrogen). The identity of the PCR- 
amplified gene cloned into pYES2 was confirmed by DNA sequencing. After transformation 
into the heterozygous ERG8/erg8::KanMX strain, the plasmid was able to complement the 
ergS::KanMX allele in S, cerevisiae since kanamycin-resistant haploid spores were viable on 
medium containing galactose but not glucose. This 5. cerevisiae strain is a useful source of 

20 biologically active C. albicans ERGS protein for assays in vitro. 

C, albicans ERGS can also be conveniently over-expressed in bacteria such as E. coli. 
The C albicans ERGS coding sequence is amplified by PGR using oligonucleotides 
containing convenient restriction sites for cloning into expression vectors such as pT7#3.3, It 
is particularly convenient if the initiation codon for ERGS is incorporated within one of the 

25 restriction sites. Oligonucleotides suitable for this are shown in Figure 7. Oligonucleotides 
may also incorporate extra sequences to encode a small "tag" that aids the subsequent 
purification of the protein. Such tags include for example, the "His^" tags which may be 
incorporated at the N- or C-terminus of ERGS using the oligonucleotides shown in Figure 8. 
Recombinantly expressed tagged ERGS protein can be conveniently purified by affinity 

30 chromatography purification methodology using commercially available purification kits (i.e 
Qiagen) (Borsig et al., Biochem. Biophys. Res. Commun. 240:586-5S9, 1997). 
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5. Hybridisation test of nucleic acid variations of specific nucleic acid 

sequences 

5.1 Hybridisation Test 

A method for detecting variant nucleic acids containing sequences related to specific 
5 ERGS sequence? such as natural alleles, is described. These variant nucleic acids niav be 
present in a variety of forms such as within plasmids or other like vehicles which may be 
fixed on to a hybridisation membrane, such as a nitrocellulose or nylon filter ready for 
detection using a labelled probe. Hybridisation assays can also be performed to identify 
variant sequences fi-om within genomic or cDNA libraries. Hybridisation technology is well 
1 0 advanced. It will be apparent to the person skilled in the art that the protocol described below 
is only one example of a hybridisation protocol suitable to identify ERGS variant sequences. 

5.2 Hybridisation probe 

Hybridisation probes may be generated from any fragment of DNA or RNA 
encoding the specific ERGS nucleic sequence of interest. Such fragments can be for example, 
15 restriction fragments isolated following restriction enzyme digestion of nucleic acid 

containing the ERGS nucleotide sequence or synthetic oligonucleotides specific for a region 
of the ERGS gene or a complementary sequence thereto. 

A hybridisation probe can be generated from a synthetic oligonucleotide or a 
dephosphorylated restriction fragment sequence by addition of a radioactive 5' phospate group 
20 fi-om [y-'^^PJATP by the action of T4 polynucleotide kinase. 20 pmoles of the oligonucleotide 
are added to a 20|il reaction containing lOOmM Tris, pH7.5 , lOmM MgCl2, 0.1 mM 

spermidine, 20mM dithiothreitol (DTT), 7.55|liM ATP, 55^Ci [y-^^P]ATP and 2.5u T4 
polynucleotide kinase (Pharmacia Biotechnology Ltd, Uppsala, Sweden). The reaction is 

incubated for 30 minutes at 37°C and then for 1 0 minutes at 70^C prior to use in 
25 hybridisation. Methods for the generation of hybridisation probes fi-om oligonucleotides or 
from DNA and RNA fragments (Chapters 1 1 and 1 0 respectively in Sambrook et al. ibid). A 
number of proprietary kits are also available for these procedures. 

5.3 Hybridisation conditions 

Filters containing the nucleic acid are pre-hybridised in 100ml of a solution 
30 containing 6x SSC, 0, 1 %SDS and 0.25% dried skimmed milk (MarveF^) at 65*^0 for a 

minimum of 1 hour in a suitable enclosed vessel. A proprietary hybridisation apparatus such 
as model HB-1 (Teclme Ltd) provides reproducible conditions for the experiment. 
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The pre-hybridisaiion solution is then replaced by 10ml of a probe solution 
containing 6xSSC, 0.1% SDS, 0.25% dried skimmed milk (e.g. MarveF^) and the 
oligonucleotide probe generated above. The filters are incubated in this solution for 5 minutes 
at 65''C before allowing the temperature to fall gradually to below SO^'C. The probe solution 
5 is then discarded and the filters washed in ] 00ml 6xSSC, 0.1% SDS at room temperature for 5 
minutes. Further washes are then made in fresh batches of the same solution at 30°C and then 
in 1 O^C increments up to 60°C for 5 minutes per wash. 

After washing, the filters are dried and used to expose an X-ray film such as 
Hyperfilm"^*^ MP (Amershani International) at -70°C in a light-tight film cassette using a fast 

10 tungstate intensifying screen to enhance the photographic image. The film is exposed for a 
suitable period (normally overnight) before developing to reveal the photographic image of 
the radio-active areas on the filters. Related nucleic acid sequences are identified by the 
presence of a photographic image compared to totally unrelated sequences which should not 
produce an image. Generally, related sequences will appear positive at the highest wash 

15 temperature (60'' C). However, related sequences may only show positive at the lower wash 
temperatures (50, 40 or SO^C). 

These results will also depend upon the nature of the probe used. Longer nucleic 
acid fi^agment probes will need to be hybridised for longer periods at high temperattire but 
may remain bound to related sequences at higher wash temperatures and/or at lower salt 

20 concentrations. Shorter, mixed or degenerate oligonucleotide probes may require less 

stringent washing conditions such as lower temperatures and/or higher Na"" concentrations. A 
discussion of the considerations for hybridisation protocols is provided in Sambrook et al. 
(Chapter 11). 

To prepare 20 x SSC,175.3 g of NaCl and 88.2 g of sodium citrate is dissolved in 
25 approximately 800ml of water, the pH is adjusted to 7.0 using ION solution of NaOH and the 
volume is adjusted to 1 litre with water, before autoclaving. 



30 
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Sequence ID Nos. 
SEQ ID NO. 1 

CCAATGGAAAAAGGAAAAGCCAGAAG.A^AAGCTCTGTTGTGTATGACCAGCTTAA 
5 TAGTGCCAATTTACAGTTTATGAAGGAATTGAGGGAAATGCGTGAAAAATACGAC 
TCAGACCCAGAGACTTATATTAAAGAGTTAGATCATTCTGTTGAGCCTTTGACTGT 
TGCGATTAAGAACATCAGAA.A.AGGGTTACAAGCATTAACACAAAAATCAGAGGT 
TCCAATTGAACCTGATGTCCAAACCCAGTTGTTGGACCGTTGTCAAGAGATTCCT 
GGTTGTGTTGGCGGTGTGGTTCCAGGTGCTGGTGGATACGATGCAATAGCTGTAT 
1 0 TAGTGTTGGAAAATCAAGTGGGAAATTTTA.AGCAGAAAACTCTTGAAAATCCAGA 
TTATTTTCATAATGTTTACTGGGTTGATTTGGAAGAGCAAACAGAAGGTGTACTTG 
AAGAAAAACCAGAAGACTATATAGGTTTATAAAATATCACTAGGATATGTCTACA 
AGGTGATTTCGATTAGATTTTCTGCTACCCGTTTTAACATATTTTACTTCAA 

15 SEQ ID NO. 2 

GCTGGTGGATACGATGCAATA 

SEQ ID NO. 3 

ATGTACATCTTTCATGTTTGAATTCCCAGTTACTTGCAATCAATTTTTTCAACTCTG 

20 TGGGGAACTTCTCAGGATCACTTTCTAGAACCTGAAACACGTCATTTATCAAAGC 

TGGCTGAAATCTTCTATATACAATCAGACCATAAATTGCAGTTGCAACATCAAAC 

CCAGATCCTATCTTTTTTTGGGCATAACAATGTGCAATCTGTGCAACGTTGTGCAA 

AATATCTTTATTCGTACTGATAACATTGGGGATAAAATGGGATAATAAACTTGTG 

GCAACAACTGACACTAATCCTGCCGATGAACCTAATCCGGTCTTTTCCACTTCGGT 

25 AATGGCACGAGAATGGTAAAGAAAAGTTTTTTCTCCATTCGAGGATGTCTTGGTT 

TCAGTATCTTCTTGTGAATGATATCCAGGGTCCGAGTAAATAATGATTTCAAGATC 

AAATGCTTCGGTCGGTTGAATATAAGCTAAAAACCGATGGATATAGTTGCCTCTA 

AAAATGGGATTTATGCGTGACTGNACTTCTTTGGGTTTTCNGTAATTGATGATATG 
TGATANTCCCATTCCCCGGTT 

30 

SEQ ID NO. 4 

GGGGATAAAATGGGATAATAAACTT 
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SEQ ID NO. 5 

GTGGAAAAAAAAGACAGAACAGTAGATTCCAACTTCAGAATATTCATTCAGATCT 
GAACATTTCTTTTTCTCCGATCATCAATTGGCAATGTCAAAAGCATTTAGTGCACC 
: TGGAAAAGCATTTCTTGCTGGTGGATATTTGGTTCTTGAGCCAATTTATGAIGCTT 
ATGTGACAGCATTGTCATCACGAATGCATGCAGTTATAACACCAAAAGGAACCAG 
TTTGAAAGAATCTAGAATCAA.AATTTCTTCACCCCAATTTGCAAACGGAGAATGG 
GAATATCACATATCATCAAATACAGAGAAGCCCAGAGAAGTTCAGTCACGCATA. 
AATCCATTTTTAGAGGCAACTATATTCATCGTTTTAGCTTATATTCAACCGACCGA 

1 0 AGCATTTGATCTTGAAATCATCATTTACTCAGACCCTGGATATCATTCACAAGAA 
GATACTGAAACCAAGACATCCTCGAATGGAGAAAAAACATTTCTTTACCATTCTC 
GTGCCATTACCGAAGTGGAAAAGACCGGATTAGGTTCATCGGCAGGATTAGTGTC 
AGTTGTTGCCACAAGTTTATTATCCCATTTTATCCCCAATGTTATCAGTACGAATA 
AAGATATTTTGCACAACGTTGCACAGATTGCACATTGTTATGCCCAAAAAAAGAT 

1 5 AGGATCTGGGTTTGATGTTGCAACTGCAATTTATGGTCTGATTGTATATAGAAGAT 
TTCAGCCAGCTTTGATAAATGACGTGTTTCAGGTTCTAGAAAGTGATCCTGAGAA 
GTTCCCCACAGAGTTGAAA.AAATTGATTGAAAGTAACTGGGAATTCAAACATGAA 
AGATGTACATTACCATACGGAATCAAGTTATTAATGGGTGACGTCAAGGGTGGCT 
CAGAAACACCCAAATTGGTATCACGAGTACTCCAATGGAAAAAGGAAAAGCCAG 

20 AAGAAAGCTCTGTTGTGTATGACCAGCTTAATAGTGCCAATTTACAGTTTATGAA 
GGAATTGAGGGAAATGCGTGAAAAATACGACTCAGACCCAGAGACTTATATTAA 
AGAGTTAGATCATTCTGTTGAGCCTTTGACTGTTGCGATTAAGAACATCAGAAAA 
GGGTTACAAGCATTAACACAAAAATCAGAGGTTCCAATTGAACCTGATGTCCAAA 
CCCAGTTGTTGGACCGTTGTCAAGAGATTCCTGGTTGTGTTGGTGGTGTGGTTCCA 

25 GGTGCTGGTGGATACGATGCAATAGCTGTATTAGTGTTGGAAAATCAAGTGGGAA 
ATTTTAAGCAGAAAACTCTTGAAAATCCAGATTATTTTCATAATGTTTACTGGGTT 
GATTTGGAAGAGCAAACAGAAGGTGTACTTGAAGAAAAACCAGAAGACTATATA 
GGTTTATAAAATATCACTGGGATATGTCTACAAGGTGTTTTCGATTAGAGTTTTTG 
ATCCCCATTTTAACATATTTTACTTCAATCTTACACTTTATCCTTTTAAGTAGGTAT 

30 GTGTAGGGAAAGAGCCTGATCTTCATAAACCGTTGCAAACTAATTGATTATATTT 
TCTATTGTAAATTTCATATGCAGGAAATAGCTTATTCGACAAATTATTTATTTTCG 
TCTCGTTCTGGTCCAAGTACCCCAGAGACGAAATAACTGACAACACGCAGGGCTG 
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GGTTGGCATTTTCGTCACACGATTATTATTAATGGTAACAAAAAAAGGGGRKATG 
CCCGTGGTCGATACACAAATATTTATGATATACTTTCCATATTTTTTTTTT 

SEO ID NO. 6 

5 Al Gl CAAAAGCATT7 AGTGCACCTGGAAAAGCATTTCTTGCTGG7GGATATTTGG 
11 Cl'l GAGCCAATTTATGATGCTTATGTGACAGCATTGTCATCACGAATGCATGCA 
GTTATAACACCAAAAGGAACCAGTTTGAAAGAATCTAG.^TCAAAATTTCTTCAC 
CCCAATTTGCAAACGGAGAATGGGAATATCACATATCATCAAATACAGAGAAGC 
CCAGAGAAGTTCAGTCACGCATAAATCCATTTTTAGAGGCAACTATATTCATCGT 

1 0 TTTAGCTTATATTCAACCGACCGAAGCATTTGATCTTGAAATCATCATTTACTCAG 
ACCCTGGATATCATTCACAAGAAGATACTGAAACCAAGACATCCTCGAATGGAG 
AAAAAACATTTCTTTACCATTCTCGTGCCATTACCGAAGTGGAAAAGACCGGATT 
AGGTTCATCGGCAGGATTAGTGTCAGTTGTTGCCACAAGTTTATTATCCCATTTTA 
TCCCCAATGTTATCAGTACGAATAAAGATATTTTGCACAACGTTGCACAGATTGC 

1 5 ACATTGTTATGCCCAAAAAAAGATAGGATCTGGGTTTGATGTTGCAACTGCAATT 
TATGGTCTGATTGTATATAGAAGATTTCAGCCAGCTTTGATAAATGACGTGTTTCA 
GGTTCTAGAAAGTGATCCTGAGAAGTTCCCCACAGAGTTGAAAAAATTGATTGAA 
AGTAACTGGGAATTCAAACATGAAAGATGTACATTACCATACGGAATCAAGTTAT 
TAATGGGTGACGTCAAGGGTGGCTCAGAAACACCCAAATTGGTATCACGAGTACT 

20 CCAATGGAAAAAGGAAAAGCCAGAAGAAAGCTCTGTTGTGTATGACCAGCTTAA 
TAGTGCCAATTTACAGTTTATGAAGGAATTGAGGGAAATGCGTGAAAAATACGAC 
TCAGACCCAGAGACTTATATTAAAGAGTTAGATCATTCTGTTGAGCCTTTGACTGT 
TGCGATTAAGAACATCAGAAAAGGGTTACAAGCATTAACACAAAAATCAGAGGT 
TCCAATTGAACCTGATGTCCAAACCCAGTTGTTGGACCGTTGTCAAGAGATTCCT 

25 GGTTGTGTTGGTGGTGTGGTTCCAGGTGCTGGTGGATACGATGCAATAGCTGTAT 
TAGTGTTGGAAAATCAAGTGGGAAATTTTAAGCAGAAAACTCTTGAAAATCCAGA 
TTATTTTCATAATGTTTACTGGGTTGATTTGGAAGAGCAAACAGAAGGTGTACTTG 
AAGAAAAACCAGAAGACTATATAGGTTTATAA 

30 SEQIDNO. 7 

MSKAFSAPGKAFLAGGYLVLEPIYDAYVTALSSRMHAVITPKGTSLKESRIKISSPQFA 
NGEWEYHISSNTEKPREVQSRINPFLEATIFIVLAYIQPTEAFDLEIIIYSDPGYHSQEDT 
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ETKTSSNGEKTFLyHSRAITfEVEKTGLGSskGLVSVVATsVLSHFIPhrVISTNKDILHNV 

• ■ ■ . ij - . :ji ' . - ■ ■ \ . - ' 

A01AHCYA0KX]GSGFDVATA]YGLIVYRRF0PALJKT»VFQVLESDPEKPPTELKKLIE 
SNAVTEKJiERCTLPYGIKLLJMGDVKGGSETPKLA^SRVLOWKKEKPEESSVVYDOLNS 
ANLOFMKELREMREKYDSDPETYIKELDHSVEPLTVAIKNIRKGL-OALTOKSEVPIEP 
5 DVOTOLLDRCOEIPGrVGGVVPGAGGYDA]AVLYaENOVGNFKOKTLENPDYFH>A' 
YW\'T)LEEQTEGVLEEKPEDYIGL 
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Figure 1 . 



CCAATGGAAAAAGGAAAAGCCAGAAGA.AAGCTCTGTTGTGTATGACCAGCTTAA 

TAGTGCCAATTTACAGTTTATGAAGGAATTGAGGG.AAATGCGTGAAA.AATACGAr 

TCAGACCCAGAGACTTATATTAAAGAGTTAGATCATTCTGTTGAGCCTTTGACTGT 

TGCGATTAAG.AACATCAGAAAAGGGTTACAAGCATTAACACAAAAATCAGAGGT 

TCCAATTGAACCTGATGTCCAAACCCAGTTGTTGGACCGTTGTCAAGAGATTCCT 

GGTIGTGTTGGCGGTGTGGTTrCAGGT GrTGGTGGATACGATGrAATA GrTnTA 

TTAGTGTTGGAAAATCAAGTGGGAAATTTTAAGCAGAAAACTCTTGAAAATCCAG 

ATTATTTTCATAATGTTTACTGGGTTGATTTGGAAGAGCAAACAGAAGGTGTACTT 

GAAGAAAAACCAGAAGACTATATAGGTTTATAAAATATCACTAGGATATGTCTAC 

AAGGTGATTTCGATTAGATTTTCTGCTACCCGTTTTAACATATTTTACTTCAA 
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Figure 2. 

ATGTACATCTTTCATGTTTGAATTCCCAGTTACTTGCAATCAATTTTTTCAACTCTG 

TGGGGA,ACTTCTCAGGATCACTTTCTAGAACCTGAAACACGTCATTTATCAAAGC 

TGGCTGAAAJCTTCTATATACA.ATCAGACCATAA.ATTGCAGTTGCAACATCAAAC 

CCAGATCCTATCTTTTTTTGGGCATAACAATGTGCAATCTGTGCAACGTTGTGCAA 

AATATCTTTATTCGTACTGATAACATT GGGGATAAAATGGGATAATAAACTT GT 

GGCAACAACTGACACTAATCCTGCCGATGAACCTAATCCGGTCTTTTCCACTTCG 

GTAATGGCACGAGAATGGTAAAGAAAAGTTTTTTCTCCATTCGAGGATGTCTTGG 

TTTCAGTATCTTCTTGTGAATGATATCCAGGGTCCGAGTAAATAATGATTTCAAGA 

TCAAATGCTTCGGTCGGTTGAATATAAGCTAAAAACCGATGGATATAGTTGCCTC 

TAAAAATGGGATTTATGCGTGACTGNACTTCTTTGGGTTTTCNGTAATTGATGATA 

TGTGATANTCCCATTCCCCGGTT 
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Figure 3. 

CTTTTTCTCCGATCATCA.^.TTGGCAM1GTCAAAAGCATTTAGTGCACCTC 

TGCTGGTGGATATTTGGTTCTTGAGCCA.2LTTTATGATGCTTATGTGACAGCATTGTCATCACG 

AATGCATG-CAGTTATA^^.CACCA-AAAGGAAGGAGTTTGA^JiGAA 

ACCCCAJlTTTGCAJ^JiCGGAGA.^.TGGGA^.TATGAGATATGATCAJ^^^^ 

AGTTCAGTCACGCA.TAAATCGATTTTTAGAGGCAACTATATTCATCGTTTTAGCTTATATTCA 
ACCGACCGAAGCATTTGATCTTGAAATCATCATTTACTCAGACCCTGGATATCATTCACAAGA 
AGATACTGAAACCAAGACATCCTCGAATGGAGAAAAAACATTTCTTTACCATTCTCGTGCCAT 
TAGGGAAGTGGAAAAGACCGGATTAGGTTCATCGGCAGGATTAGTGTCAGTTGTTGCCACAAG 
TTTATTATCCCATTTTATGCCCAATGTTATCAGTACGAATAAAGATATTTTGCACAACGTTGC 
ACAGATTGCACATTGTTATGCCCAAAAAAAGATAGGATCTGGGTTTGATGTTGCAACTGCAAT 
TTATGGTCTGATTGTATATAGAAGATTTCAGCCAGCTTTGATAAATGACGTGTTTCAGGTTCT 
AGAAAGTGATCCTGAGAAGTTCCCCACAGAGTTGAAAAAATTGATTGAAAGTAACTGGGAATT 
CAAACATGAAAGATGTACATTACCATACGGAATCAAGTTATTAATGGGTGACGTCAAGGGTGG 
CTCAGAAACACCCAAATTGGTATCACGAGTACTCCAATGGAAAAAGGAAAAGCCAGAAGAAAG 
CTCTGTTGTGTATGACCAGCTTAATAGTGCCAATTTACAGTTTATGAAGGAATTGAGGGAAAT 
GCGTGAAAAATACGACTCAGACCCAGAGACTTATATTAAAGAGTTAGATCATTCTGTTGAGCC 
TTTGACTGTTGCGATTAAGAACATCAGAAAAGGGTTACAAGCATTAACACAAAAATCAGAGGT 
TCCAATTGAACCTGATGTCC7VAACCCAGTTGTTGGAGCGTTGTCAAGAGATTCCTGGTTGTGT 
TGGTGGTGTGGTTCCAGGTGCTGGTGGATACGATGCAATAGCTGTATTAGTGTTGGAAAATCA 
AGTGGGAT^TTTTAAGCAGAAAACTCTTGAAAATCCAGATTATTTTCATAATGTTTACTGGGT 
TGATTTGGAAGAGCAAACAGAAGGTGTACTTGAAGAAAAACCAGAAGACTATATAGGTTTATA 
AAATATCACTGGGATATGTCTACAAGGTGTTTTCGATTAGAGTTTTTGATCCCCATTTTAACA 
TATTTTACTTCAATCTTACAGTTTATCCTTTTAAGTAGGTATGTGTAGGGAAAGAGCCTGATC 
TTCATAAACCGTTGCAAACTAATTGATTATATTTTCTATTGTAAATTTCATATGCAGGAAATA 
GCTTATTCGACAAATTATTTATTTTCGTCTCGTTCTGGTCCT^GTACCCCAGAGACGAAATAA 
GTGACAACACGCAGGGCTGGGTTGGCATTTTCGTCACACGATTATTATTAATGGTAACAAAAA 
AAGGGGRKATGCCCGTGGTCGATACACAAATATTTATGATATACTTTCCATATTTTTTTTTT 
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Figure 4. 

MSK-APSAPGKAFLAGGYLVLEPmDAYVTALSSR-MHAVlTPKGTSLKESRIKlSSPOFA 
NGEWIEYHJSSNTEKPRPVOSRmWLEATlFIVLAYlOPTtAFDLElllYSDPGYHSOEDl 
ETKTSSNGEKTFLYHSRAITEVEKTGLGSSAGLVSVVATSLLSHFIPNVISTNKDILHNV 
A0]-AHCYA0KKJGSGFDV\4TA]YGL]VTRJU^QPALITnTDVFQVLESDPEKJPTELKKL1E 

smveekj^erctlpyg]kllmgdvkggsetpklvsrvlowkkekpeessvvydqlns 
anlqfmkelpiemrekydsdpetyikeldhsvepltvaiknirkglqaltoksevpiep 
dvqtqlldrcqeipgcvggvvpgaggydaiavlvlenqvgnfkqktlenpdyfhnv 
ywvdleeqtegvleekpedyigl 
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Figure 5. 

1 . AAATGTCAGAGTTGAGAGCCTTCAGTGCCCCAGGGAAAGCGTTACTAGCTGCA 
GCTGAAGCTTCGTACGC 

2. AGTTATTTATCAAGATA.AGTTTCCGGATCTTTTTCTTTCCTAACACCCCAGGCA 
TAGGCCACTAGTGGATCTG 
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Figure 6. 

1 . CCCAAGCTTGGCAATGTCAAAAGCATTTAGTGC 

2. CCGCTCGAGATTTTATAAACCTATATAGTCTTCTGG 
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Figure 7. 

1 . CATATGTCAAAAGCATTTAGTGCACC 

2. GGATCCTTATAAACCTATATAGTCTTCTGG 
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Figure 8. 

1 . CATATGCACCATCACCACCATCACTCAAAAGCATTTAGTGCACC 

2, GGATCCTTAGTGATGGTGGTGATGGTGTAAACCTATATAGTCTTCTGG 
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